Profiling Techniques for a Fine-Grained Parallel Language

نویسندگان

  • Chris J. Scheiman
  • Bjoern Haake
  • Klaus E. Schauser
چکیده

Fine tuning the performance of large parallel programs is a very difficult task. A profiling tool can provide detailed insight into the utilization and communication of the different processors, which helps identify performance bottlenecks. In this paper we present two profiling techniques for the fine-grained parallel programming language Split-C, which provides a simple global address space memory model. One profiler provides a detailed analysis of a program’s execution. The other profiler collects cumulative information. As our experience shows, it is much more challenging to profile programs that make use of efficient, low-overhead communication. We incorporated techniques which minimize profiling effects on the running program. We quantify the profiling overhead and present several Split-C applications which show that the profiler is useful in determining performance bottlenecks.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Approach for Fine-Grained Profiling of Mesh-Based Parallel Programs

Characterizing the dynamic behavior of parallel programs in terms of their execution profile helps to understand their behavior and suggest optimization strategies to improve the performance. Traditional event tracing techniques write the profiled data to trace files. Using the traditional approach for fine grained profiling not only yields large unwieldy trace files but often also gives skewed...

متن کامل

An improved joint model: POS tagging and dependency parsing

Dependency parsing is a way of syntactic parsing and a natural language that automatically analyzes the dependency structure of sentences, and the input for each sentence creates a dependency graph. Part-Of-Speech (POS) tagging is a prerequisite for dependency parsing. Generally, dependency parsers do the POS tagging task along with dependency parsing in a pipeline mode. Unfortunately, in pipel...

متن کامل

Load Balancing in the Fine-Grained Object-Oriented Language Ellie

This position paper presents the main goals of the Ellie project which have been to design and implement a general, powerful, fine-grained, object-oriented, programming language intended for machine independent programming of distributed memory parallel computers. The implementation includes operating system facilities for message routing and load balancing. The main subject of this paper is lo...

متن کامل

Synchronization coherence: A transparent hardware mechanism for cache coherence and fine-grained synchronization

The quest to improve performance forces designers to explore finer-grained multiprocessor machines. Ever increasing chip densities based on CMOS improvements fuel research in highly parallel chip multiprocessors with 100s of processing elements. With such increasing levels of parallelism, synchronization is set to become a major performance bottleneck and efficient support for synchronization a...

متن کامل

Evolution of texture in an ultrafine and nano grained magnesium alloy

The evolution of texture was discussed during the formation of ultra-fine and nano grains in a magnesium alloy severely deformed through accumulative back extrusion (ABE). The microstructure and texture obtained after applying multiple deformation passes at temperatures of 100 and 250°C were characterized. The results showed that after single ABE pass at 100°C an ultrafine/nano grained microstr...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007